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Abstract. We show that asymptotic equivalence, in a strong form, 
holds between two random graph models with slightly differing edge 
probabilities under substantially weaker conditions than what might 
naively be expected. 

One application is a simple proof of a recent result by van den Esker, 
van der Hofstad and Hooghiemstra on the equivalence between graph 
distances for some random graph models. 



1. Introduction 

There are many different models of random graphs. Sometimes, the dif- 
ferences are minor, and it can be guessed that the asymptotic behaviour of 
two models are the same (for all or at least for some interesting properties). 
This note concerns some cases where it is possible to actually prove such 
results in a strong form. We begin by defining the two types of asymptotic 
equality that we will study. All unspecified limits are as n — > oo. 

Definition 1.1. Let (X n ,A n ), n > 1, be a sequence of arbitrary measurable 
spaces and let P n and Q n be two probability measures on (X n ,A n ). 

(i) The sequence (P n )n is asymptotically equivalent to (Q n )m denoted 
by (P n ) n — (Qn)n, if for every sequence of measurable sets A n (i.e., 
A n G A n ), we have P n (A n ) - Q n (A n ) -> 0. 

(ii) The sequence (P n )n is contiguous with respect to (Q n ) n , denoted 
by (P n ) n < (Qn)m if for every sequence of measurable sets A n such 
that Q n (A n ) — > 0, we also have P n (A n ) — > 0. 

We use the same terminology and notations for sequences of random vari- 
ables X n and Y n with values in the same space X n , meaning that these 
properties hold for their distributions C(X n ) and C(Y n ). For example, 
(X n ) n = {Yn)n means that f(X n G A n ) — F(Y n £ A n ) — » for every sequence 
(A n ) n . We will also use the simpler notations X n = Y n and X n <\ Y n , etc. 

Note that asymptotic equivalence is a symmetric relation while contiguity 
is not; we say that (P n )n and (Q n )n are (mutually) contiguous, (P n )n O 

if both (P n ) n < (Q n )n and (Q n ) n < (P n )n, i-e., if P n (A n ) — > 
Q n (A n ) — > for any sequence of measurable sets A n C X n . (And similarly 
for sequences of random variables X n and Y n .) 
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Asymptotic equivalence implies contiguity, but not conversely (see e.g. 
Example 11.21 and Remark ll.6j) , so contiguity is a weaker property. 
We illustrate these notions by two simple examples. 

Example 1.2. In the special case of two constant sequences, P n = P and 
Qn = Q where P and Q are two probability measures defined on the same 
space (X n ,A n ) = (X;A), (P n )n = (Qn)n if and only P = Q, and (P n ) n < 
(Qn)n if and only if P <C Q, i.e., P is absolutely continuous with respect 
to Q. Hence asymptotic equivalence and contiguity can be thought of as 
asymptotic versions of equality and absolute continuity, respectively. 

Example 1.3. Let X n be random elements in some spaces X n and let £ n be 
events that depend on X n only, i.e., £ n = {X n G E n } for some (measurable) 
sets E n C X n , and suppose that liminf P(£ n ) > 0. Let Y n := (X n \ £ n ) 
be X n conditioned on £ n (possibly ignoring some small n with P(£ n ) = 0). 
Then, for any A n , and some C < oo, 

F(Y n G A.) = F{Xn ^j n nEn) < CF(X n G A n n E n ) < CF(X n G A n ) 

i) 



and thus (Y n ) n < (X n ) n 

An important random graph example of this is when Y n is a random graph 
with a given degree sequence d±, . . . ,d n , uniformly chosen among all such 
graphs, and X n is the random multigraph constructed by the configuration 

model (see, e.g., Bollobas 0]); then Y n = (X n \ X n is a simple graph) so 
(Y n ) n < (X n ) n provided liminf n _ voo P(X n is simple) > 0. This is the case 



when Yli=i di — ► oo and Y17=i = ^\Ya=1 ^»)> see Janson 13(] (with sev- 
eral earlier partial results by various authors), which makes it possible to 
transfer many results from X n to Y n . Indeed, this is a standard method 
to study random graphs with a given degree sequence, and in particular 



random regular graphs, see e.g., [3], [14j |. [1 



Remark 1.4. Suppose that X n = Y n . If F(X n G A n ) — > a for some 
sequence of (measurable) sets A n and some a G [0, 1], then F(Y n G A n ) — > a 
too. Hence, any result for X n that can be stated in terms of convergence 
of some probabilities holds for Y n too; for example, this includes any result 

of the type ip n (X n ) » a and ip n {X n ) W for some functional ip n : 
X n — * R (and a number a or a random variable VF). However, results that 
are sensitive to events with small probabilities, such as moment convergence 
or large deviation estimates, do not transfer automatically. For example, if 
X n = Y n and we know that E (p n (X n ) — * a, we may guess that E (p n (Y n ) — > a 
too, but we cannot conclude it without further information (for example 
uniform integr ability of ip n {X n ) and ip n (Y n )). 

If instead only X n > Y n , then results of the type (p n (X n ) — > a still 

transfer to Y n , but not result on convergence in distribution. (If ip n (X n ) -—* 
W, then the sequence cp n (Y n ) is tight, but does not have to converge to W, 
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or at all. Typically, <p(Y n ) — > W' for some W ^ W, see e.g. Example 11.21 
and several examples of cycle counts in [3, Chapter 9].) 

Suppose now that G n and G' n are random graphs on the vertex set [n] := 
{1, . . . , n}. By the standard Theorem 14.21 below. G n = G' n if and only if it is 
possible to couple G n and G' n , i.e., to define them simultaneously on some 
probability space, such that P(G n ^ G' n ) — > 0. (We assume that we are 
interested only in the distributions of G n and G' n , so we may replace them 
by any random graphs with the same distributions.) 

In particular, we will study random graphs of the following type. If p^, 
1 < i < j < n, are given probabilities in [0,1], let G{n, {pij}) be the random 
graph on [n] where the edge ij appears with probability p^ and the indicators 
Iij := l[edge ij appears], 1 < i < j < n, are independent. (We will later 
also consider an extension to random Section [2j) 

Consider two sequences of such graphs, defined by probabilities {Pij}i<i<j< 
and {p'ij}i<i<j<n', Pij and pij may depend on n too, but to simplify the no- 
tation we do not show this explicitly. It is obvious that we may couple 
the edge indicators 1^ and Z[ ■ of ij in G(n, {pij}) and G(n, {p[j}) such that 
F(Iij ^ IL) = \pij —p'ij\, and by taking independent pairs (i^, J^ ) we obtain 
a coupling of the random graphs G(n, {pij}) and G(n, {p^}) with 

P(G(n, { Pij }) + G{n, {p^})) < ^ \ Pij - p^. (1.1) 

i<j 

Consequently, G(n,{pij}) = G(n, {p'^}) if Y,i<j\Pij ~ Pij\ -> 0; a simple 
fact that has been used by many authors. It may be believed that this is 
essentially best possible, but, somewhat surprisingly, this is not so. In fact, 
by Corollary 12.121 below, provided pij < 0.9, say, G(n,{pij}) = G(n, {pL}) 
if Yli<j(Pij ~ Pij) 2 /Pij ~^ 0- (Moreover, Theorem I2.2f i) shows that this is 
best possible if, for example, p'^ < 2pij.) 

For a particular case, suppose that p' i:j = Pij + Oipfj), Then Corollary EH 
shows that G(n, {pij}) = G(n, {p^}) if Yli<j P% ~^ 0' while (II. 1|) implies this 
only under the stronger condition Y^,i<j Pij ^ 0. A typical case where this 
is an important improvement is when all pij = Q(l/n) and |pL — p%j\ = 
0(l/n 2 ). See further the examples in Section [3l 

To see that such an improvement of (jl.ip might be possible at all, consider 
as an example the case when all p^ are the same, so we consider the random 
graph G(n,p): 

Example 1.5. Let p = p(n) and p' = p'{n) be given in [0, 1] and consider 
G(n,p) and G(n,p'). Let iV := (2) be the number of possible edges and let 
M ~ Bi(N,p) and M' ~ Bi(N,p') be the number of edges in G(n,p) and 
G(n,p'). The conditional distribution of G(n,p) given M = m is uniform 
over all graphs on [n] with m edges, and the conditional distribution of 
G(n,p') given M' = m is the same. It follows that any coupling of M 
and M' may be extended to a coupling of G(n,p) and G(n,p') such that 



4 



SVANTE JANSON 



G(n,p) = G(n,p') when M = M'\ as a consequence, using (14. 6p below, 
dTv(G(n,p),G(n,p')) = dry(M, M'), and in particular, as n — ► oo, using 
also Theorem 14.21 

G{n,p) = G(n,p') M^M'. 

Since M and M' have binomial distributions with the same n, P(M = 
k) I P(M' = k) is monotone in k, and it follows that the maximum of | P(M G 
A) — P(M' G j4)| over subsets ^4 of Z is attained for a set of the form 
[0,1 J"]- 

Now suppose that p — > and iV(p' — p)j \J Np — > a G [— oo, oo]. Suppose 
first that a is finite. By the central limit theorem, (M — Np) / y 1 Np — ► 
N(0, 1) and (M' - Np)/y r Np~ -^U N(a, 1), and it follows easily that 



d TV (G(n,p),G(n,p')) = d TV (M,M') = sup|P(M < j) - P(M' < j)\ 

j 

= sup | P(N(0, 1) < x) - P(N(a, l)<x)\+ o(l) 
-> *(a/2) - *(-a/2). 
It follows, using Theorem 14.21 again, that G(n,p) — G(n,p') if and only if 



a = 0, i.e. A^(p' — p) j yj Np — > 0, which is equivalent to X^o'G'' ~p) 2 /p 



N(p' — p) 2 /p — > 0. For example, if p = 1/n and p' = 1 — e 1//n = p — 
\n~ 2 + 0(n~ 3 ), then iV(p' - pf jp = 0(l/n), so G(n,p) = G(n,p'), but 
A^|p — p'\ — > 1/4, so (jl.ip is not enough to show this. 

We see that in this example, the trick to improve the simple and 'obvious' 
edgewise coupling used in (jl.ip . is to first ignore the positions of the edges 
and couple their numbers only; this then is extended to a coupling of the 
random graphs by randomly reinserting the positions. Corollary I2.12( i) 
shows that couplings improving the simple edgewise coupling exist also when 
the edge probabilities are unequal, but in that case we do not know any 
explicit construction of such couplings. 

We give the main results in Section [2] and a number of examples in Sec- 
tion [3j this includes an application to a recent result by van den Esker, van 
der Hofstad and Hooghiemstra (Example EH]). Proofs are given in Section [5j 
after some preliminaries in Section HI 

We use the standard notations o p and O p , see e.g. [13, Section 1.2], and 
we write whp (with high probability) for events with probability tending to 
1 as n — > oo. 

Remark 1.6. There are also interesting examples of contiguity among ran- 
dom graphs of other types than G(n,{pij}). In particular, several different 
constructions of random regular graphs (or multigraphs) are known to yield 
distributions that are (mutually) contiguous but not asymptotically equiva- 
lent, see e.g. 12], [13, Section 9.5], [l3|. These examples are not covered by 
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2. Results 

We defined above the random graph G(n, p), where p = {pij}i<i<j< n is 
a vector of probabilities. We extend the definition of G(n, p) to the case 
when p is a random vector (with entries in [0,1]) by conditioning on p, i.e., 
given p = {pij}, the edge indicators Iy are independent with ~ Be(pij). 
Random graphs of this type have been studied in many papers, see for 
example Bollobas, Janson and Riordan [4| and the further references given 
there. 

We now state our main results on asymptotically equivalent and contiguity 
of such random graphs. Actually, the results have nothing to do with the 
graph structure and the way the indicator variables are indexed by pairs 
ij. It therefore seems more natural to consider the more general situation 
of a (finite or infinite) sequence (Ii)^ of indicator variables. (An indicator 
variable is a random variable with values in {0, 1}, i.e. a random variable 
with a Bernoulli distribution Be(p) for some p E [0,1].) The results for 
random graphs then follow by relabelling the indicators. 

We define a function p : [0, l] 2 — > [0, oo) in Definition 12.11 where we also 
give some equivalent (within constant factors) alternative formulas that of- 
ten are more convenient. Since the results below are not affected by chang- 
ing p within constant factors, we could use any of these alternative formulas 
(and several other similar ones) as our definition. (The motivation for the 
definition comes in Lemma 14.31 ) 

We write x x y (where x, y > 0) to denote that cy < x < Cy for some 
positive constants c, C, i.e., that x = Q(y) (or, equivalently, x = 0(y) and 
y = 0(x)). Further, we use x\/ y and x Ay for the maximum and minimum, 
respectively, of x and y. We interpret 0/0 as 0. 

Definition 2.1. Let 

pM ■= {Vp - Vq) 2 + {V^p - V^q) 2 (2-1) 

^>" g)2 + ip ~ q)2 (2.2) 



p+q 1 —p+ 1 

(P ~ g) 2 

(pVg)A((l-p)V(l-g)) 

- q) 2 



(2.3) 



p A (1 — p) 
In particular, if p < 0.9, then 



A\p-q\. (2.4) 



p(p,q)~^-^-A\p-q\. (2.5) 
P 

Of course, the constant 0.9 here and below is arbitrary and could be 
replaced by any number < 1. 
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Proof. The first equivalence follows from 

iVp + Vq) 2 p + q 

together with the similar result with 1 — p and 1 — q. The second follows 
from x + y x x V y for x, y > (used thrice). The third is equivalent to 

(pVg)A ((1 -p) V (1 - q)) X (p A (1 — p)) V |p - g|, (2.6) 

which is easily verified, for example by assuming (by the symmetry p i— ► 
1— p, g i— ► 1 — g) that p < 1/2, in which case (|2.6[) easily reduces to 
pVgxpVlp — q\. □ 

We state our results first for the simpler case of sequences of independent 
indicator variables with given (non-random) probabilities. The following 
theorem gives necessary and sufficient conditions for asymptotical equiva- 
lence and contiguity. (The asymptotical equivalence criterion follows by a 
simple and standard type of calculation with Hellinger distances, see the 
proof in Section [5] and, e.g., [H, p. 158], although we have not seen it stated 
in this form before. The contiguity criterion is a special case of a result by 
Oosterhoff and van Zwet [l|| for general sequences of independent variables.) 
The proofs of the theorems are given in Section [5j 

Theorem 2.2. Let 1 < N(n) < oo and let X n = (I n i)fjf and X' n = 

(^ni)i=i be finite or infinite random vectors consisting of independent indi- 
cator variables I n i ~ Be(p n j) and I' ni ~ Be(p^). 

(i) X n = X' n if and only if 

N(n) 

J2 P(P"iM - °- ( 2 - 7 ) 

i=l 

(ii) X n < X' n if and only if 

N(n) 

J2p(j>ni,Pni) = 0(l) (2.8) 
i=l 

and, with q ni := 1 - p ni and q' ni : = 1 - p' ni , 



lim limsup p ni + q ni \ =0. (2.9) 



n—*oo 



Remark 2.3. By symmetry, X n D> X' n is equivalent to (|2.8p and 



n— »oo 




lim limsup V j/ ni + V 4 =0, (2.10) 



and thus X„ <> is equivalent to flggp , (|2l)j) and (|2TT0j) . 
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Remark 2.4. Often p n i < 0.9 for all n and i, and then the second sum in 
(12.91) vanishes for C > 10 and can thus be omitted. 



Remark 2.5. The condition (j2.9h is only needed to take care of cases when 
p n i and p' ni (or q n i and q' ni , in case p n j and p' ni are close to 1) are not of the 
same order. If no such Pni 

and p' ni appear, which is the typical case, then 

(|2.8p is thus enough. 

We may rewrite (|2.9p in several ways. For example, it is equivalent to 
(following the formulation in [lit ] in a more general case) : for every sequence 

C n -> DO, 

P™ + Yl Qni^O- (2.11) 

i:Pni>C n p' ni i-qni>C„q' ni 

It is also equivalent to: For every e > 0, there exist C and no such that if 
n > uq, then 

^2 Pm < £ and ^ <j ni < e. (2.12) 

i:pni>Cp' ni i-q n i>Cq' ni 



Remark 2.6. As pointed out by Oosterhoff and van Zwet [la], (|2.8j) does not 
imply (|2.9p in general. A simple counter example is provided by N(n) = n, 
Pni = n -1 , p' ni = n~ 2 . (On the other hand, it is easy to see, and also follows 
by the theorem, that (J2ZJ) implies (J2J1 and (gUDJ) .) 

Remark 2.7. In the very special case when N(n), p n j and do not depend 
on n (and we omit the subscript n), it is easily shown that if < p\ < 1 for 
all i, then ([2T5]) implies ([2~^]) . and thus (A") < (A'), which by Example O 
says that the distribution of A is absolutely continuous with respect to the 
distribution of A'. If we further assume also < pi < 1, by symmetry the 
distributions are thus mutually absolutely continuous. This is part of the di- 



chotomy by Kakutani for product measures, see e.g. [ll|, Corollary IV. 2. 38], 
which in our case says that either ^ p(Pi,Pi) < oo and the distributions are 
mutually absolutely continuous, or ^ p(Pi,Pi) = oo and the distributions 
are mutually singular. 

Returning to the general case in Theorem 12.21 it is easy to show that, 
analoguously, if ^ p(p n i,p' ni ) — > oo, then the distributions of X n and X' n 
are asymptotically mutually singular in the sense that there exist sets A n 



with P(A n £ A n ) — > 1 and ¥{X' n G A n ) -> 0, cf. [ill, Theorem V.2.32] 



Remark 2.8. We have stated Theorem 12.21 in terms of sequences of pairs 
of random vectors. It is possible (at least partly) to rephrase it in terms of 
estimates for a single pair (A, A'), see Lemmas 15.11 and 15.21 below. Similar 
reformulations may be made for Theorem 12.91 but we leave these to the 
reader. 

We extend Theorem 12.21 to the case of random probabilities p n i. In this 
case we cannot expect conditions that are both necessary and sufficient, so 
we give only sufficient conditions, which are more important in applications. 
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(An important obstacle to finding necessary conditions is that different dis- 
tributions of the probabilities may give the same distribution of the indica- 
tors. For example, using the notation of Theorem 12.91 if p n i are i.i.d. with 

Vni ~ U(0, 1) and p' ni = 1/2, then X n = X' n .) 

Theorem 2.9. Let 1 < N(n) < oo and suppose that p n = {p n i} and 
Pn = {Pni\ are random vectors in [0, l] N ^ . Let X n = (I n i)i=i and 
Xn = (^ni)i=i be random vectors of indicator variables such that the condi- 
tioned random vectors (X n | p n ) and {X' n \ p' n ) are sequences of independent 
indicator variables with (L n i | p n ) ~ Be(p n i) and (J' ni | p' n ) ~ Be(p' ni ). 

(i) V 

N(n) 

^ p(Pni,Pni) = Op(l), (2.13) 



then X n = X' n 
(ii) V 



i=i 



N(n) 

Y J P(PnuPni)=O p {l) (2.14) 
i=l 



and, with q ni : = 1 - p ni and q' ni := 1 - p' ni , for every e > 0, 



lim limsupP V p ni + V q ni > e = 0, (2.15) 

\i:p ni >Cp' ni i:q ni >Cq' ni / 

then X n < J 5 ^. 

Remark 2.10. Recall that if S n denotes the random sum on the left-hand 
side of (I2.13p . then (|2.13j) can also be written S n — —* 0. Similarly, the O p (l) 
notation in (|2. 14[) means that for every e > 0, there exists C such that 
P(S' n > C) < e for all n; this is also known as stochastic boundedness or 
tightness of the sequence {S n }, and is equivalent to P(S n > C n ) — » for 
every sequence C n — » oo. 

Remark 2.11. In analogy to (12. Ill) , the condition (|2.15p is equivalent to: 
For every sequence C n — > oo, 

P™ + S <7™^0. (2.16) 

As said above, Theorems 12.21 and 12 . 91 apply immediately to random graphs 
G(n,p). We state a version of Theorem 12.91 for this case, where we have 
added some simplifying assumptions. Recall that pij and p'^ may (and 
typically do) depend on n, although we do not show that in our notation. 

Corollary 2.12. Let, for each n, p = {pij} and p' = {p'ij} be random 
vectors of probabilities and suppose that whp xaaxijpij < 0.9. 
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(i) // 



then G{n,p) = G(n,p'). 

(ii) If 

=O p (l), (2.18) 

then G(n,p) > G(n,p'). 

(iii) If ()2.18p holds, and further, for some constant c > 0, whp cpij < 
Pij < 0.9 /or a// i,j, then G(n,p) <> G(n, p'). 

We specialize further to an important case. 

Corollary 2.13. Let, for each n, p = {pij} and p' = {p[j} be random 
vectors of probabilities and suppose that p'^ = p^ + 0(p?). 

W V EiKjPij = Op(l), then G(n,p) * G(n,p'). 

(ii) If Yli<jPij = P {1), and further, for some constant c > 0, whp 
m&Xij p^ < 0.9, m&Xijp'- < 0.9 and p'- > cp^ for all i,j, then 
G(n,p) <> G(n,p'). 

3. Examples 

Example 3.1. Bollobas, Janson and Riordan [J] study a general class of 
sparse random graphs G(n, k) which include many cases studied earlier by 
various authors. These random graphs are defined as G(n, {pij}) with 

(!) • f K ( x i, x j) A » A ro-,^ 

Pij : = Pij : = mm { — S 1 ) = Pij A h I 3 - 1 ) 

with 

n 

where k : S x S — ► [0, oo) is a symmetric measurable function defined on 
some measurable space S and random sequence of elements of 

5, not necessarily i.i.d. but such that the empirical distribution of xi, . . . , x n 
converges to a probability measure /i on 5; see [H for details. (Some further 
technical conditions are assumed in [4(; they are not relevant here.) Typi- 
cally, whp n(xi,Xj) < n for all i, j, and then p^j equals the simpler p^. Two 
natural variations, also treated in 0] and used in various cases by various 
authors, are obtained by replacing (|3.ip by 

Pi/ ■= 1 - exp(^ — ^-J = 1 - exp{-pij) (3.3) 

or 



J3) ._ K(Xj,Xj) _ pjj 

n + n(xxj) 1 + p^ 



10 



SVANTE JANSON 



(Thus pfj /(l — Pif) = Pij', at least in the case studied in Example 13,51 
below, this is in some sense simpler and more natural, see Britton, Deijfen 
and Martin-L6f [fj].) In all cases pf) = pij + 0(pfj), which is the essential 
estimate for our purposes; the results below extend to the general case 

Pij := h(pij) for a function h with h(x) = x + 0(x 2 ). (3.5) 

It was shown in 0] that the same asymptotic results hold for these three 
versions for the properties studied there. We can now show that, under 
an extra condition, the three versions are asymptotically equivalent, and 
thus have the same asymptotic behaviour for any property. Indeed, Corol- 
lary 12.131 applies immediately and shows that if 

^ n(xi,Xj) 3 = o p (n 3 ), (3.6) 

l<i<j<n 

then all three G{n,pf^) : t = 1,2,3, are asymptotically equivalent; similarly, 
if the weaker 

<Xi, Xj f = O p (n 3 ) (3.7) 

l<i<j<n 

holds together with maxijpij < 0.9 whp, then all three G{n,pf^) are mu- 
tually contiguous. 

In fact, (13.7(1 alone suffices for G{n,p\^ ) <> G{n,p\^ ) because (I3.7P im- 
plies maxj .j pij = Op(l) so by conditioning we may assume that maxjj pij 

C\ for some constant C\, and then, for I = 2,3, p\- < C2 < 1 and 

(£) 

cpij < p\- < Pij- Furthermore, by the same conditioning and Corol- 
lary EHii), ([3ZD implies G{n,pf)) > G{n,pf)) and G{n,pf)) > G{n,pf)). 
However, if, for example k(x\,X2) > n whp, then I\2 = 1 whp in G(n,p\^) 

but not in G(n,p^- ) or G(n,plj ), and we do not have contiguity in the 
opposite direction. 

We study some special cases in the following examples. 

Example 3.2. One common case of the construction in Example 13.11 uses 
xi, . . . , x n that are i.i.d. on S with distribution /x. In this case, we show that 
the condition 

fi x p{(x,y) : k(x,u) > t)} = o(t~ 2 ) as t —* 00 (3-8) 

implies (I3.6P and thus asymptotic equivalence of the three versions. In par- 
ticular, this holds if f SxS k(x, y) 2 dfi(x) d^i(y) < 00. 

In fact, if G(t) := fi x /j,{(x,y) : n(x,y) > t)} = o(t~ 2 ), then 

E(/t(xi, X2) A n) 3 = / (ft(x,y) A n) 3 dfj,(x) dfi(y) 
JSxS 

r 1 

M 2 G(t) dt = n 3(ns) 2 G(ns) ds = o(n) (3.9) 
Jo 
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by (13. 8p and dominated convergence. Hence, E Yli<j( K ( x ii x j) Are) 3 = o(n 3 ), 
so J2i<j( K (. x i> x j) ^ n ) 3 = °p( n3 )- Moreover, 

l<i<j<n l<i<j<n 

< P(/«(xi,Sj) > n) < n 2 G(n) = o(l), 

l<i<j'<n 

and (|3.6p follows. 

Similarly, we can easily shown that (13. 7j) . and thus at least partial conti- 
guity, follows from 

\i x fi{(x,y) : n(x,y) > t)} = 0{t~ 2 ) as t — ► oo. (3.10) 
In this case, given e > 0, there exists C\ such that 

P( Y « x i> x j) ACin) 3 ^ ^ ^(xi,^) 3 ) < e; 

l<i<j<n l<i<,j<n 

further, a calculation as in (|3.9|) yields E J2i<i<j<n( K ( x i> x j)^Cin) 3 = 0(n 3 ) 

and thus P(Xa<i<j<n( At ( a; *> ^i) ^ ^i n ) 3 > C^n 3 ) < e for some C2; we omit 
the details. 

Example 3.3. Another case of the construction in Example 13.11 uses S = 
(0,1] with fj, = Lebesgue measure and the deterministic Xi = i/n, i = 
1, . . . , n. The homogeneous case k(x, y) = c/(x V y) yielding pij = c/(i V j), 
where c > is a constant, is particularly interesting and related to the 
CHKNS model, see Bollobas, Janson and Riordan 0, Sections 16.1], Durrett 
@, [I] and Riordan [TtJ and the references given there. 

In this case, £i<i<j<oo P% < c 3 J2j>2 3 ' i~ 3 < °°> and thus T.i<jP% = 
0(1); if we further for simplicity assume c < 2 and thus maxy jjy < 1, then 

Corollary El3](iii) implies that G(n,p\f) <> G{n,pf)) <t> G(n,p\f). 

Note that in this case, p^n }, P12 an d P12 are constant and different, which 
shows that the three random graphs are not asymptotically equivalent (for 
a trivial reason). 

We have p = c/(iVj + c); the same results hold for the further variation 
Pij = cj{% V j + d) for any d > c — 2. 

In this example, the infinite random graphs G{oo,pf^), I = 1,2,3, are 
well-defined too, and it follows from Kakutani's criterion discussed in Re- 
mark [22] that (still provided c < 2) these three infinite random graphs have 
mutually absolutely continuous distributions, which is the infinite graph ver- 
sion of the contiguity result just given for finite n, cf. Example 11.21 (The 
infinite random graph G(oo,p-j) was studied before the finite version, see 
[H] and [3], [1] with further references.) 

Example 3.4. A related case uses the same S = (0, 1], p = Lebesgue mea- 
sure and Xi = i/n, i = 1, . . . , n, as Example 13.31 now with the homogeneous 
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n(x,y) = c/^/xy yielding = c/y/ij; this case is a mean-field version of 
the preferential attachment model by Barabasi and Albert [l|, see Bollobas, 
Janson and Riordan 0, 16.2] and Riordan 17] and the references given there. 



Also in this case, Y^x<i<j<ooP% < °°> an d tnus ^2i<jPij = 0(1) (in 
spite of the fact that (|3.10p does not hold); if we further for simplicity 
assume c < y2, and thus maxyjPjj < 1, we obtain the same results as in 
Example [ 



Example 3.5. A common case of Example 13.11 is when n(x,y) = ij)(x)ip(y) 
for some function : S ^ [0,oo), see [J, Section 16.4] for discussion and 
references to previous papers. 

In this case, Yl%<j K { x i: x j) 3 — (Si VK 3 ^) 3 ) > so (|3.6p and (|3.7p may be 
replaced by 

n 

;>>(*,) 3 = o P (n 3/2 ), (3.11) 
i=l 

and 

n 

J>(^) 3 = O p (n 3 / 2 ). (3.12) 

i=l 

If we combine this choice of k with the i.i.d. choice of Xi in Example 13.2^ 
it is easily seen, arguing as in (|3,9p but now with ^(^(^i) A n 1//2 ), that 

fi{x : Tp(x) > t)} = o(t~ 2 ) as i ^ oo (3.13) 

implies ()3.6p and thus asymptotic equivalence of the three versions; in par- 
ticular this holds if J ^(x) 2 dfj,(x) < oo. Similarly, 

n{x : if){x) > t)} = 0(t~ 2 ) ast^oo (3.14) 

implies (|3.7|) and thus at least partial contiguity. 

Example 3.6. van den Esker, van der Hofstad and Hooghiemstra study 
a minor variation of the construction in Example 13.51 they let Ai, . . . , A n be 
positive i.i.d. random variables with some fixed distribution and define pij 
by (in our notation) (|3.ip . (|3.3p . (|3.4p or more generally (|3.5j) with 

AjA ? - . 

Pij ■= \ ■ (3-15) 

(This too can be seen as an instance of the general construction in Exam- 
ple EH see 0, Section 16.4].) 

Assume that P(Ai > t) = o(t~ 2 ) (which is the case in 0]). Then, just as 
(I3TTTD follows from ([3TT5D . ££Af = o p (n 3 / 2 ). Since further YX A i/ n 
E A > by the law of large numbers, it follows from (|3.15p that Yli<j Pij ~ 
0. Hence, if we compare two random graphs G(n, p) and G(n, p') defined 
by this method for two different functions h and h', we obtain G(n,p) = 
G(n,p') by Corollary EH 
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van den Esker, van der Hofstad and Hooghiemstra [9J study the distance 
H n between two random points, and (a minor) part of their proof consists 
in showing that the choice of h does not matter (enabling them to consider 
only the version f)3. 3f) in the main part of the proof) : the variables H n and 
H' n obtained by two different functions h and h! in (|3.5p can be coupled such 
that ¥{H n ^ H' n ) = o(l), or in our notation H n = H' n , see Theorem 14.21 
We thus obtain this as an immediate consequence of the stronger statement 
G(n,p) = G(n,p'), which by Theorem 14.21 means that the random graphs 
can be coupled with P(G(ra,p) / G(n,p')) 0. 

Example 3.7. Our results are stated for graphs with a deterministic num- 
ber of vertices, but can be extended to graphs with random vertex set too 
by conditioning on the vertex set. One interesting such case is obtained 
from Example 13.11 by letting x%, . . . , x n be the points of a Poisson process 
on S with intensity Xfj,, where A > is our parameter and we consider 
asymptotics as A — * 00; thus n is random with the distribution Po(A). 

Conditioned on n, we have the situation in Example 13.21 It follows, 
for example, that if (|3.8p holds, then the random graphs defined in this 
way using (|3.ip . (|3.3p and (|3.4p are asymptotically equivalent; we omit the 
details. 

Example 3.8. Bollobas, Janson and Riordan [5j study a generalization of 
the model in Example 13.11 where small sets of edges are added at once, thus 
allowing a certain degree of clustering; more precisely, for every subgraph F 
of the complete graph K n , we have a certain probability of adding (the edges 
of) F, and these events are independent for different F. While this intro- 
duces dependencies between the edge indicators, the results of the present 
paper are still applicable to the sequence of indicators Ip describing the 
added sets of edges, and asymptotic equivalence or contiguity for two ver- 
sions of this sequence obviously implies asymptotic equivalence or contiguity 
for the resulting random graphs too. 

We leave the explicit statement of results in this case to the reader. 

Example 3.9. In this final example, let us return to the case of deterministic 
P = {Pij} an d let us change all py proportionately to := (1 + S n )pij for 
some S n . Assume for simplicity that all p^ < 0.9 and that \S n \ < 1. 

By Corollary EEfi), if ftjZiKjPij -» °> then G (n,p) = G(n,p'). Fur- 
ther, by Corollary 12 . 1 2f iii) , if 5% J2i<jPij = ^(1) and, for simplicity, S n — ► 0, 
then G(n,p) <I> G(n,p'). 

In fact, by ([23]) . p{p ij ,p' ij ) x S^Pij A \5 n \pij = S^pij, and thus by Theo- 
rem !2.2l the conditions 5% Yli<j Pij ~^ and 5^ Yl,i<j Pij = are necessary 
too for asymptotic equivalence and contiguity, respectively. (The necessity 
can also be checked by considering the total number of edges, as in the 
special case in Example 11.51 ) 

Remark 3.10. As in Example 13.91 necessity in Theorem 12.21 can in many 
cases where p\ - < p^ for all i and j (or conversely) be proved by considering 
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the total numbers Y2i Ini anci Y^i I'nv but this method does not suffice in all 
cases. A simple counter example is given by N(n) 



4 i 8 

n + rr 



Pr, 



for 



1 < i < n and p ri 



n 



for i > n 4 , and p' ni = p n i — pi;', it is easily checked 



that then (|2,7p and (|2,8p do not hold, and thus we do not have asymptotic 
equivalence or even contiguity, but, using [2, Theorems 2.M and l.C], 



d T v(Po(n 5 + n 3 ) 



Po(n 5 + n 3 - 2n 2 )) + o(l) 



0. 



4. More on asymptotic equivalence and contiguity 

We will use two metrics to measure the distance between probability 
distributions. We state some well-known definitions and facts, see e.g. [2J, 
Appendix A.l] and Chapter IV. 1 and V.4a]. 



Definition 4.1. If P and Q are two probability measures on the same 
measurable space (X,A), and R is any cr-finite measure on (X,A) such that 
P <C R and Q <C R, define the total variation distance 





dp 


dQ 


L 


dR 


dR 



dR 



(4.1) 



and the Hettinger distance 



d n (P,Q) :-- 




1/2 



dR 



[1-H(P,Q)) 1/2 (4.2 



where H(P, Q) is the Hettinger integral 
H(P,Q) :-- 



(4.3) 



f dP 
J x \dR~\ dR 

(We can, at least symbolically, write (14.3(1 as H (P, Q) := J x yJdP dQ.) Note 
that these quantities do not depend on the choice of R. (We may thus take, 
e.g., R = P + Q.) 

We have d TY (P,Q) = \ \\P-Q\\, using the standard norm on real-valued 
measures. (The factor | is conventional and convenient but unimportant, 
as is the factor | in the definition of c?h-) 

We use the same notations for two random variables X and Y with values 
in X, with dTv(X,Y) := dTy(£(X), C(Y)) and similarly for dn and H. 

(Thus d T v{X,Y) = <=^> d R {X,Y) = <=^> X = Y .) In particular, 

d TV (X,Y) := sup |P(X G A)-F{Y G A)\. (4.4) 

It is easily seen that d^y and c?h are metrics on the set of all probability 
measures on (X,A); further, < d^v < 1, < da < 1 and < H < 1, and 

dnCP, Q) < ^tv(P Q) < v^dHCP, Q); (4.5) 
hence dxv an d (iH are equivalent metrics. Furthermore, dTy(P, Q) = 1 <J=^ 
du(P, Q) = 1 <^=^ P 1 Q, i.e., P and Q are mutually singular. 
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Recall that a coupling of two random variables X and Y with values in 
the same space is a pair of random variables (X',Y'), defined together on 

the same probability space, with X' = X and Y' = Y. It is well-known that 



d TV (X,Y) 

taking the minimum over all couplings (X',Y') of X and Y. 



min F(X' + Y') 

(X',Y>) 



(4.6) 



Theorem 4.2. Let X n and Y n be random variables with values in X n . Then 
the following are equivalent. 

(i) X n * Y n . 

(ii) dTv{X n ,Y n ) -+ 0. 

(iii) d n (X n ,Y n ) -.0. 

(iv) ff(X n ,y n )-l. 

(v) There exist couplings {X' n ,Y^) of X n and Y n such that V{X' n ^ 



tion IL1| 

(USD. 



(ii) 



in 



by 63]); (iii) 



(i) 






(iv) 



by (fOl) and Defini- 

~by 
□ 

We calculate the Hellinger distance and integral for two Bernoulli distri- 
butions. (This is the origin of our function p in Definition 12.11 ) 

Lemma 4.3. For any p,q £ [0, 1], 

dn{Be(p),Be(q)) =2-V 2 p(p,q) 1 / 2 , 
H(Be(p),Be(g)) = l-\p{p,q). 

Proof. Use @2J with P = Be(p) = p<fo+(l-p)<$i, Q = Be(g) = q5 +(l-q)5 1 
and -R = 5q + $i i together with the definition (|2.ip . □ 



An important, and well-known, property of Hellinger distances and inte- 
grals is that they behave simple for product measures. Let [n] := {1, . . . , n} 
when n < oo and [oo] := N = {1, 2, . . . }. 

Lemma 4.4. Let 1 < N < oo and let, for i £ [N], Pi and Qi be probability 
measures on the same measurable space (Xi,Ai). IfP = UiLiPi arid Q = 
UtiQi, ^en H(P,Q) = l\Zi H ( p i^Qi)- 

Proof. This is stated in, e.g., [0, Proposition IV. 1.73], but for completeness 
we give the simple proof. 

If N < oo, the result is an immediate consequence of (14. 3p and Fubini's 
theorem, choosing e.g. Ri = Pi + Qi and R = Yli=i Ri- 

If N = oo, let T n be the a-field on flSi *i given by {A x A* : A € 

rj™^4j}, and let P n := P\r n an d Q n := Q\r n - Then, using the finite case, 

n n n 

H(P n ,Q n ) = H(HP l ,HQi) =Y[H(Pi,Qi). 
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Furthermore, choosing R = (P + Q)/2 on X := Uili x h dP n /dR = 
K(dP/dR | J- n ) with respect to R, so (dP n /dR) is a bounded i?-martingale 
and dP n jdR — > dP/dR R-a.s., and similarly for Q n . Hence, (|4.3j) and dom- 
inated convergence yields Q) = lim n _>oo H(P n , Q n ) = Il^i 

□ 



5. Proofs 

Proof of Theorem \2.SX (i): By Lemmas 14.41 and 14.31 

N(n) N(n) 

H(X n ,X' n ) = H H(I m ,l' m ) = II (1 - \ P (PmM))- 
1 1 

Hence, 

N(n) , N{n) 

1- \ XI p(Pni,Pni) < H ( X n,K) < expf -5 X /5(Pm,Pni)J, 
1 ^ 1 ' 

and thus H{X n ,X' n ) —>■ 1 <J=^ El PiPniiPni) ~^ 0> which yields the 
result by Theorem 14.21 

(ii): This is, in view of Lemma f4.3l and the equivalence of (|2.9h and (j2.11[) . 
a special case of flti . Theorem 1], to which we refer for a complete proof. 
Nevertheless, for completeness, we sketch a proof of the more important "if" 
direction. 

First, we can by a simpler version of the argument in the proof of Theo- 
rem 12.91 below assume that p n i < C2p' ni and q n i < C2q' ni for some constant 
C 2 . (We define p'^ by ([531) with Pni '■= Pni and use (|QjH|5l)jU Under this 
assumption, if we let P ni := C(I ni ) = Be(p n j), P n := T7j P ni , P' ni := C(I' ni ) = 
Be(p' ni ), P' n := U t PL we have by Fubini, using J (dP ni / dP^) dP' ni = 1 and 

urn 

dPn \ , ^1 1 T I ( dP n i \ ^, T T / - f ( dPri 



dPL 



JJ A + (Pni - Pni) 2 + (Pni ~ Pni? 



P'ni 1 - P'm 

< n(i + (Q. + D ( ; m ~ + y 2 + (c 2 + 1) , ) 

j \ Pni ' Pni 1 Pni T r P n j / 

<n( i+ ( c72+i ) c ^™'^)) 

< expf (C 2 + l)C^p(Pm,i 
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and thus P n < P' n , which is the same as X n <\ X' n 



□ 



We say that a finite or infinite random vectors of indicator variables X = 
(h)f = i has distribution Be(p), where p = {p{\f = i is a deterministic vector 
with elements in [0, 1], if the random variables Ii are independent indicator 
variables with Ii ~ Be(pj). 

More generally, if p = {pi}f = i is a random vector with elements in [0, 1], 
with N < oo, we say that random vectors of indicator variables X — (Jj)j =1 
has distribution Be(p) if the conditioned random vector (X | p) is a sequence 
of independent indicator variables with (/j | p) ~ Be(j?j). 

We next give two results comparing two random vectors with distributions 
Be(p) and Be(p') with deterministic p and p'. The first result is easily seen 
to be equivalent to the "if" direction of Theorem I2.2( i) , while the second is 
equivalent to a special case of the "if" direction of Theorem I2.2f ii) . 

Lemma 5.1. For every e > 0, there exists 5 > such that if X ~ Be(p) 

and X' ~ Be(p') for two deterministic vectors p = {p{\f = i and p' = 

of the same length N < oo, and these satisfy Y2i=x PiPiiPi) < then 

d TV (X,X')<e. 

Proof. Suppose not. Then there exist e > and such random vectors 

X n ~ Be(p n ) and X' n ~ Be(p^) such that Y^x P(Pni,p' ni ) < l/ n but 
d"[\i{X n ,X' n ) > s, but this contradicts Theorems I2.2( i) and 14.21 □ 

Lemma 5.2. For every constants C\,Ci and e > 0, there exists 5 > 
such that if X ~ Be(p) and X' ~ Be(p') for two deterministic vectors 
P = {Pi}f=x ana " P' = Wl£i °f the same length N < oo, and these satisfy 
Yi=x PiPhP'i) — Cx and further, for every i G [N], pi < C2p\ and (1 — Pi) < 
C 2 (l-p-), then for every set A with F(X' £ A) < 5, we have F(X G A) < e. 

Proof. If not, it would be possible to find, for some fixed C%, C2 and e, 
sequences X n and X' n of such random vectors and sets A n such that F{X' n G 
A^) < 1/n and ¥(X n G A n ) > e. In particular, X n ^ X' n . 

On the other hand, (|2.8p and (|2.9p hold for these random vectors (since 
the sums in (|2.9[) vanish for any C > C2), and thus Theorem I2.2( ii) yields 
X n <\ X' n , which is a contradiction. □ 



Proof of Theorem \2.9l (i): Let e > and choose 5 > as in Lemma 15.11 
Then, by Lemma 15.11 applied to the conditioned variables {X n \ p n ) and 
( x n I Pn)> if HiP(Pni,p' ni ) < then d T v{(X n I p n ), {X' n I p'J) < e. Since 

P(X n eA)- F(X' n g A) = E(P(X n eA\ Pn )- F(X' n g A | p^)) 
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for every measurable A C X n = {0, \} N{ - n \ it follows that 
d TV (X n ,X' n ) < Ed TY ((X n | Pn ),(X' n | p'J) <e + F(j2p(PnUPni) > $ 

i 

The latter probability tends to by assumption, and since e is arbitrary, 
this yields dTy(X n , X' n ) — > 0. 

(ii): Let (A n ) n be an arbitrary sequence measurable sets with A n C X n = 
{0, \} N{ ^ and let e > 0. 

By (12. 14p . there exists Ci such that IP(Ei p(p™,Ku) > Ci) < e for all n. 
Similarly, by (I2.15p . there exist C2 > 1 and no such that for n > no, 

X! Pm + E Qui > £ J < e; 

i:p n i>C'2p' ni i-qni>C2q' ni / 

in the sequel we consider only n > no- 
Define p» = {p'^j? by 

Pn> Ei PiPnUP'ni) > Cl or Ei:p nl >C 2 p^ Pni + Ei^X?^ 9ni > £] 

p n , otherwise. 

By our choices of C\ and C2, F(p^ 7^ p n ) < 2e, and we may thus define 
K = ~ Be(p^) such that 

P(X£ ^ X n ) < P(p^ + p n ) < 2e. (5.1) 
Moreover, by the construction, with q'^ := 1 — p" ni , 

^2p(PnvPni) <Cl and X] ^"* + S ^ni<£- (5-2) 
Next, define = by 

Ptrf> Pni > C 2Pni ° r ?ni > C 2Q n i'> 

p" otherwise. 



Pn : 



We can construct X'" ~ Be(p^') using maximal couplings of (Z^ | p"') and 
(J^ I p n ) so that, using O and (P , 



d T v(« I p£'), (X;' I p£)) < E I PS). tf* I J&)) 

i 

— / j \Pni Pni\ 



< E Pni+ E «^ B - ( 5 - 4 ) 

Consequently, 

drv < E d T V ( W I Pn ), (*n I Pn)) < e. (5.5) 
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Furthermore, by flO}, p% < C 2 p' ni and q% : = 1 - j% < C 2 q' ni and by 
EB and (El , 



E^xo^E^"-^)^- 

We can thus apply Lemma I5T21 to the conditioned variables {X'" | p"') and 
(X' n | p^) for all values of p"' and p' n . Consequently there exists 5 > such 
that if ¥(X' n G 4 | p' n ) < 5, then F(X% G 4 | p£') < e. Hence, using 
Markov's inequality, 

P«" G 40 =EP(X;" G A n | p£') < e + P(Pp£ G ^ n | p^) > 5) 
<e + r 1 KP(X' n eA n \p' n ) = e + r 1 P« G A n ). 

Using ()5.ip and (|5.5H . we thus obtain 

P(^n G 4,0 < P(X n ^ X%) + (kviK, K) + nX'n £ 4.) 

< 4e + ^ 1 P(X; G 40. 

If we assume that P{X' n G 40 ~ * °> ^ follows that limsupP(X n G 40 < 4e, 
and since e is arbitrary thus P(X n G 4) - ► 0, which shows that X n <\ 
X' n . □ 

Proof of Corollary '[2JM In order to apply Theorem 12.91 we reorder {p%j}i<j 

to {Pni}f=i j we do this without further comment. We also let q^j := 1 —p%j 
and := 1 - 

(i) : By ([231) . whp p(pij,Pij) < C (pij - p'ij) 2 /pij for some C , and thus 
(|2.17p implies (|2.13p . and the conclusion follows by Theorem 12. 9( i). 

(ii) : Similarly, by (|2.5f) again, ()2. 18f) implies (I2.14|) . Moreover, for any 
C7>2, 

. PiJ - c . h p» Pij ■ (5 - 6) 

i:p' l:j >Cpij t:Py>Cp« J ' ' 

Hence, for any sequence C n — > oo, (|2.18p implies that X^y. .>C n vu ~ ~~ * ®' 
Moreover, for any C > 10, Cg^ > 1 and thus q'^ < Cqij for all i,j. It 
follows that (|2.16p with p and p' interchanged holds, and thus (|2.15p with 
p and p' interchanged holds. (The latter is also easily proved directly using 
(15. 6p .) Consequently, Theorem 12.9( h) yields G(n,p') < G(n,p). 

(iii) : The extra assumptions allow us to interchange p and p' in the 
assumptions. Hence (ii) yields both G(n, p) > G(n, p') and G(n, p') > 
G(n,p). □ 

Proof of Corollary \2. 131 An immediate consequence of Corollary 12. 121 since 
now (p^ —p'ij) 2 /pij = 0(pfj); note also that the assumption in (i) implies 
maxijpij = Op(l) and thus maXijPij < 0.9 whp. □ 
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